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A big open question in the quantum information theory concerns feasibihty of a self- 
correcting quantum memory. A quantum state recorded in such memory can be stored 
reliably for a macroscopic time without need for active error correction if the memory is put 
in contact with a cold enough thermal bath. In this paper we derive a rigorous lower bound on 
the memory time Tmem of the 3D Cubic Code model which was recently conjectured to have 
a self-correcting behavior. Assuming that dynamics of the memory system can be described 
by a Markovian master equation of Davies form, we prove that T„iem > L"^^ for some constant 
c > 0, where L is the lattice size and /3 is the inverse temperature of the bath. However, this 
bound applies only if the lattice size does not exceed certain critical value L* ^ e^^^. We 
also report a numerical Monte Carlo simulation of the studied memory indicating that our 
analytic bounds on Tmem are tight xvp to constant coefficients. In order to model the readout 
step we introduce a new decoding algorithm which might be of independent interest. Our 
decoder can be implemented efficiently for any topological stabilizer code and has a constant 
error threshold under random uncorrelated errors. 

PACS numbers: 03.67.Pp, 03.67.Ac, 03.65.Ud 
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I. INTRODUCTION 

Any practical memory device must function reliably in a presence of small hardware imperfec- 
tions and protect the recorded data against thermal noise. Building a memory capable of storing 
quantum information is particularly challenging since a quantum state must be protected against 
both bit-flip and phase-flip errors. Furthermore, in contrast to classical bits, quantum states form 
a continuous set thus being more vulnerable to small hardware imperfections. 

Ground states of topologically ordered many-body systems, such as fractional quantum Hall 
liquids |30tl37j and unpaired Majorana fermions in nanowires and 2D heterostructures [U [201 [391 HU 
[33] were proposed as a natural quantum data repository insensitive to local imperfections. A qubit 
encoded into the ground subspace of a topologically ordered system is almost perfectly decoupled 
from any local perturbation due to local indistinguishability of the ground states [H [H [331 [36] . 

To undo the effect of noise, a user of any memory, either classical or quantum, must invoke some 
form of error correction. It was shown by Dennis et al p^8j that topological memory based on the 
Kitaev's 2D toric code model [34J can tolerate stochastic local noise provided that error correction 
is performed frequently enough to prevent errors from accumulating. However, such active error 
correction would require an extensive and fast classical input/output to the quantum hardware 
which might pose a challenge for building large-scale memory devices. 

An intriguing open question raised in [5l [18] is whether topological memories can be self- 
correcting, that is, whether active error correction can be imitated by the natural dynamics of 
the memory system coupled to a thermal bath. The physical mechanism behind self-correction 
envisioned in [5] relies on a presence of "energy barriers" separating distinct ground states and 
energy dissipation. If the energy barriers are high enough and the evolution time is not too long, 
the memory system will be locked in the "energy valley" surrounding the initial ground state. A 
user of a self-correcting memory would only be responsible for preparation of the initial ground 
state and performing one final round of active error correction at the readout step. The latter 
is required to clean up the residual low-energy errors. In contrast to the active error correction 
approach of [18], the storage itself would require no action from the user whatsoever. 

The nature of excitations in a topological memory plays a crucial role in assessing its self- 
correcting capability. Anyons in the 2D toric code [M] provide a paradigmatic example of such 
excitations. A fundamental flaw of 2D topological memories that rules out self-correction is the lack 
of energy barriers that could suppress diffusion of anyons over large distances [18j . For instance, 
consider a process that involves a creation of anyon pair from the ground state, a transport of one 



4 



anyon along a non-contractible loop on the torus, and a final annihilation of the pair. This process 
enacts a non-trivial transformation on the ground subspace. However, it can be implemented by a 
stream of local errors at a constant energy cost. The lack of self-correction for the 2D toric code 
model was rigorously confirmed by Alicki et al [2] who showed that the relaxation time towards 
the equilibrium state is a constant independent of the lattice size. A more general no-go result for 
quantum self-correction based on arbitrary 2D stabilizer code Hamiltonians was derived in \12\ [32] • 

While anyons have a rich algebraic structure that depends on a particular model, their common 
property is that anyons are topological defects. More broadly, a topological defect is a point-like 
excitation that cannot be created from the ground state by a local operator without creating other 
excitations. A domain wall in the ID ferromagnetic Ising chain provides the simplest example of a 
topological defect. The second property that applies to all models with anyonic excitations is that 
anyons are mobile topological defects. In other words, once an anyon has been created at some 
spatial location, a local operator can move it to another location nearby without creating any other 
excitations. As we argued above, the latter property is the key reason why self-correction is hard, if 
not impossible, to achieve in 2D. It raises a question whether the laws of physics permit topological 
phases of matter without mobile topological defects ? (In the coding language it translates to 
existence of topological codes without string- like logical operators.) Quite surprisingly, the answer 
turns out to be yes. A family of 3D topological memory models without mobile topological defects 
was recently discovered by one of the authors [26]. One of these models, which we call the 3D 
Cubic Code, obeys the so called no-strings rule |7]. It establishes a constant upper bound a on 
the mobility range of topological defects. More precisely, given any isolated topological defect 
occupying a spatial region of size R, no local operator can move this defect distance aR or more 
away without creating extra excitations, see |26j for details. A strong evidence in favor of self- 
correcting properties of the 3D Cubic Code has been recently obtained in |7j by proving that the 
energy barrier separating distinct ground states grows as a logarithm of the lattice size. 

The main goal of the present paper is to assess self-correcting properties of the 3D Cubic Code 
in a more direct way by calculating its memory time as a function of the lattice size and the 
temperature. For the sake of concreteness, we shall state our results only for the 3D Cubic Code, 
but our analysis applies to any quantum memory model based on a stabilizer code with local 
generators which obeys the topological order condition and the no-strings rule as stated in [7]. 
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FIG. 1. Stabilizer generators of the 3D Cubic Code. Here X = and Z = a'-^ represent single-qubit Pauli 
operators, while / is the identity operator. Double-letter indices represent two-qubit Pauli operators, for 
example, IZ = I (g) Z, ZZ = Z (g) Z, II = I (g) I etc. 



II. MEMORY HAMILTONIAN, THERMAL NOISE, AND DECODING 



In order to use any memory, either classical or quantum, a user must be able to write, store, 
and read information. In this section we describe these steps formally for a topological quantum 
memory based on the 3D Cubic Code. Definition of the code Hamiltonian and its basic properties 
are summarized in Section III A[ Our formal model of the thermal noise based on Davies weak- 



coupling limit is presented in Section II B The readout step consists of a syndrome measurement 



and an error correction, see Section II C 



A. 3D Cubic Code Hamiltonian 



Let A = T^L X X Z^, be the regular 3D Cubic lattice of linear size L with periodic boundary 
conditions. Each site of the lattice is occupied by two qubits. The 3D Cubic Code Hamiltonian 
introduced in [26J has the following form: 

H = -j'^G^ + Gf , 

c 

where the sum runs over all elementary cubes c and the operators , G^ act on the qubits 
of c as shown on Fig. [l| The positive coupling constant J will be set to J = | for simplicity. 
We shall refer to operators G^ , G^ as stabilizer generators, or simply stabilizers. Note that each 
stabilizer acts non-trivially only on 8 qubits. Let us recall some basic properties of the 3D Cubic 
Code, see [2^ for details. First, one can easily check that the stabilizers G^ , G^, commute with 
each other for all c, c'. A ground state of -fT is a common +1 eigenstate of all stabilizers. The 
degeneracy of the ground states is 2'^^^^ for some integer 2 < k{L) < 4L. The ground subspace 
of H has topological order, that is, different ground states cannot be distinguished locally. More 
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precisely, if O is any operator whose support can be bounded by a cubic box of size < L then the 
restriction of O onto the ground subspace is proportional to the identity operator. Excited states 
of H can be described by configurations of defects, that is, stabilizers whose eigenvalue is —1. Each 
defect costs one unit of energy. 

B. Thermal noise 

Suppose at time t = Q the memory system is initialized in some ground state p{Q) encoding 
a quantum state to be stored. We shall model interaction between the memory system and the 
thermal bath using the Davies weak coupling limit [TT]. It provides a Markovian master equation 
of the following form: 

p{t) = -i[H,p{t)]+C{p{t)), t>0. (1) 

Here p{t) is the state of the memory system at time t and C is the Lindblad generator describing 
dissipation of energy. To define £, let us choose some set of self-adjoint operators {Aa} through 
which the memory can couple to the bath. We shall assume that each acts non-trivially on a 
constant number of qubits. For example, {A^} could be the set of all single-qubit Pauli operators. 
Let Aa = Aa^uji where Aa^u is the spectral component of Aa that maps eigenvectors of H with 
energy E to eigenvectors with energy E — lo. Then 

^^P^ = 12Y1 - lip, ^lu.Aa,co}] . (2) 

The coefficient h{a,uj) is the rate of quantum jumps caused by Aa transferring energy uj from the 
memory to the bath. It must obey the detailed balance condition 

h{a,-io) = e-^'^h{a,uj), (3) 

where /3 is the inverse bath temperature. The detailed balance condition Eq.Q is the only part of 
our model that depends on the bath temperature. It guarantees that the Gibbs state ~ e~^^ 
is the fixed point of the dynamics, C{pj3) = 0. This is a unique fixed point under certain natural 
ergodicity conditions [42j. Furthermore, we shall assume that < 1 and 

max/i(a,cj) = 0(1). (4) 

Let us remark that the Davies weak coupling limit was adopted as a model of the thermal dynamics 
in most of the previous works with a rigorous analysis of quantum self-correction; see for instance O 
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C. Decoding 

The final state p{t) generated by the Davies dynamics can be regarded as a corrupted version of 
the initial encoded state p{0). A decoder retrieves the encoded information from p{t) by performing 
a syndrome measurement and an error correction. A syndrome measurement involves a non- 
destructive eigenvalue measurement of all stabilizer generators , . The measured syndrome 
S can be regarded as a classical bit string that assigns an eigenvalue ±1 to each generator. Let Us 
be the projector onto the subspace with syndrome S. 

The error correction step is specified by an algorithm that takes as input the measured syndrome 
S and returns a correcting Pauli operator Pec{S). A good choice of the error correction algorithm 
is a highly non-trivial task which we discuss in Section |VI[ The net action of the decoder on states 
can be described by a trace preserving completely positive (TPCP) linear map ^ec such that 

^ec{p) = PeciS)Us pUsPec{S)\ (5) 
S 

where the sum runs over all possible syndromes. 

III. MAIN RESULTS AND SKETCH OF THE PROOF 

Our first result is an upper bound on the storage error — the trace distance between the initial 
encoded state and the final error corrected state. 

Theorem 1. There exists a decoder $ec o,nd a constant c > such that for for any inverse 
temperature f3 > 0, any state p{0) supported on the ground subspace of H , any evolution time 
t > 0, and any constant < a < 1 one has 

e{t) = ||p(0) - ^ec{p{t))\\i < 0{t) ■ 2^=^ • /3 (6) 

as long as L < e^^~"'^^^^. The error correction algorithm used by the decoder has running time 
poly{L). 

Clearly, our bound is most useful when k{L) is small. In the following we shall mostly be 
interested in the smallest ground state degeneracy, k{L) = 2. This happens for any odd 3 < L < 200 
such that L is not a multiple of 15 or 63, see |26j . One can also show j25j that there exists an 
infinite sequence of lattice sizes such that k{L) = 2, for example, k(2P + 1) = 2 for all p > 1. 
Unfortunately, a general explicit formula for k{L) is unknown. 
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Linear system size 



FIG. 2. The memory time Tmem vs. the system size L. In the upper inset is shown the exponent of the 
power law fit of Tmem for the first a few system sizes. It is clear that Tmem on ^2.93/3-10. 5 ^ ]^*^ where 

L* is the optimal system size where Tmem reaches maximum. The data for f3 = 4.3, 4.5, 4.7, 4.9, 5.1, 5.25 are 
shown. 



Residual (logr,„„„) 




Inverse Temperature {0=2J/kTi 

FIG. 3. The maximum memory time Tmem vs. the inverse temperature /3. The memory time is maximized 
with respect to the system size. The logarithm of Tmem clearly follows a quadratic relation with /3 as oppose 
to a linear one. 
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The upper bound on the storage error can be easily translated to a lower bound on the memory 
time. Indeed, if one is willing to tolerate a fixed storage error e, say e = 0.01, the memory time 
Tmem Can be defined as the smallest t > such that e{t) > e. Assuming that the lattice size is 
chosen such that k{L) = 2, Theorem [l] implies that 

Tmem>L-'"'-^ ioT ^ny L < e^^'^^^ '\ (7) 

Here we neglected the overall constant coefficient. It shows that for low temperatures, /3 ^ 1, 
and sufficiently small system size, memory time grows polynomially with L, while 

the degree of the polynomial is proportional to /3. To the best of our knowledge, this provides 
the first realistic example of a topological memory with a self-correcting behavior. Unfortunately, 
the studied model has no self-correction in the thermodynamic limit (3 = 0(1) and L — )■ oo. In 
this sense, it is only partially self-correcting. At the optimally chosen lattice size, the maximum 
memory time Tmem{P) achievable at a given temperature /3 is easily found from Eq. ([T]): 

TmeM > (8) 

for /3 ^ 1. For comparison, the memory time of the 2D toric code model grows only exponentially 
with /?, see [21 [16]. Depending on the value of the constant c and the temperatures realizable in 
experiments, the scaling Eq. ^ may be favorable enough to achieve macroscopic memory times. 

The restriction L <^ e^^^ implies that the average number of defects (flipped stabilizers) in 
the equilibrium Gibbs state p/s ~ e^^^ is less than one, that is, the Gibbs state has most of its 
weight on the ground subspace of H. This might suggest that the thermal noise is irrelevant in 
the studied regime. However, this is not the case. If the evolution time is large enough, so that 
p{t) w pp, the encoded information cannot be retrieved from p{t), since pp does not depend on the 
initial state. If a time t ~ e^^'^^^ has elapsed, the system would have accommodated approximately 
tL^e~^ ~ e^^^^-* defects during the evolution. This implies in particular that the system has 
endured e^^^^^ errors which becomes significant for low temperatures. 

Since Theorem [T] only provides a lower bound on the memory time, a natural question is whether 
this bound is tight and, if so, what is the exact value of the constant coefficient c ? To answer this 
question, the memory time of the 3D Cubic Code has been computed numerically for a range of 
/3's and L's. It should be emphasized that both Theorem [l] and our numerical simulation use the 



same decoder at the readout step (see below). Our simulation, described in detail in Section VIII 
provides a Monte Carlo estimate of the successful decoding probability p{t) as a function of the 
evolution time t. The decay of p(i) was observed to follow an exponential law, p{t) = exp(— t/r). 
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The characteristic decay time r was chosen as our numerical estimate of the memory time, that 
is, we set Tmem = T. The scahng of Tmem as a function of L is shown on Fig. [2} It suggests 
that Tmem ~ ^2.93/3-10.52 j^j^g as L < L* ?s gO.78^-0.87^ rpj^g numerical results strongly suggest 
that our analytical bound Eq. Q is tight up to constant coefficients (for the chosen decoder). A 
comparison between the numerical data for Tmem{P) and the analytical bound Eq. Q is shown 
on Fig. [sj The numerics suggests a scaling TmemiP) ^ gi-69/32+o(/3)^ The numerical data were 
obtained for 26 values of (3 in the interval 4.0 < P < 5.25 for each lattice size 5 < L < 33 satisfying 
k{L) = 2 (that is, L is odd and L ^ 15). For each pair {L,f3) the memory time was estimated 
using a few hundred Monte Carlo samples. Details on interpretation of the numerical data and the 



error analysis can be found in Section VIII 



The numerical computation of the memory time has become possible due to our second result, 
which is an efficient error correction algorithm applicable to any topological stabilizer code. We 
call it a Renormalization Group (RG) decoder since it falls into a larger family of error correction 
algorithms in which a syndrome is processed in a hierarchical way using real-space renormalization 
methods |19tl28j. Our algorithm and its rigorous analysis borrow many ideas from [28J. A detailed 



description of the RG decoder and its efficient implementation are described in Section VI Let us 
briefly sketch the main idea of the algorithm. The RG decoding is a sequence of simple subroutines 
parameterized by integer levels p = 0,1, . . . , log2 L. At any given level p, the decoder decomposes 
a syndrome into disjoint connected clusters, where the connectivity is defined using 2^ as a unit 
of length. This step can be implemented in time 0{N), where N is the volume of the lattice. 
The decoder then examines each cluster individually and tries to 'annihilate' it by a local Pauli 
operator. Clusters that cannot be annihilated are passed to the next level (that is, p + 1). The 
time needed to test whether a given connected cluster can be annihilated depends on a particular 
code. For the 3D Cubic Code we show how to perform this test in time 0{V), where V is the 
volume of the smallest rectangular box enclosing the cluster. Once the decoder reaches the highest 
level, p = log2 L, it returns the product of annihilation operators over all clusters that have been 
successfully annihilated. The overall running time of the RG decoder is poly{N) for any topological 
stabilizer code. 

The applicability of the RG decoder is by no means restricted to errors generated by thermal 
noise. A natural question is how well the RG decoder performs against local stochastic noise. In 
Appendix \K\ we numerically benchmark the RG decoder for the 2D toric code, where we consider 
random independent X (or equivalently Z) errors with a uniform rate p. Our numerical estimate 
of the error threshold is Pc ~ 6.7% which is reasonably close to the threshold Pc ~ 11% obtained 
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using the minimum weight matching decoding |18j . Perhaps more importantly, we prove that 
the RG decoder has a constant error threshold against local stochastic errors for any topological 
stabilizer code, see Theorem [2] in Appendix |B} To the best of our knowledge, the RG decoder 
is the first error correction algorithm applicable to any topological stabilizer code that combines 
good practical performance with a rigorous lower bound on the error threshold and polynomial 
worst-case running time. 

Theorem [Tf sketch of the proof 

In Section |V] we prove some basic facts about "thermal errors" generated by the dissipative 
Lindblad dynamics. We define an energy barrier A(P) of a Pauli operator P as the smallest 
integer m such that P can be implemented by a stream of single-qubit errors that does not create 
more than m excitations during the process, see Section V A[ The detailed balance condition and 



locality of quantum jump operators in the Lindblad equation imply that thermal errors are unlikely 
to have a high energy barrier. More precisely, Lemma [T] in Section [V B| provides an upper bound 
on the storage error e(t) assuming that any error with energy barrier A(P) < m will be corrected 
at the decoding step. Here m is any fixed "energy cutoff'. The upper bound on e(t) involves 
the Boltzmann suppression factor e~^"^, and an entropic factor that grows exponentially with the 
volume of the lattice. We show that the entropic factor can be neglected in the regime Ne~^ < 1 
which yields a bound 

e{t) < 0(t)2^(^)A^e-^™, (9) 



see Section V C[ Clearly, this bound is only useful if m grows at least logarithmically with N. At 



this point we must invoke the special properties of the 3D Cubic Code. Our key technical ingredient 
is Theorem 2 of Ref. [7]. It asserts (loosely) that any Pauli operator P creating a cluster of defects 
that contains a "topologically charged" sub-cluster separated from its complement by distance R, 
has energy barrier at least A(P) > clogi? for some constant coefficient c. Choosing the energy 
cutoff m = c'logL for some constant c' < c, we can guarantee that a syndrome S{P) caused by 
any Pauli operator P satisfying A(P) < m, consists of connected "topologically neutral" clusters 
of defects, where the connectivity is defined using R ^ L'^ ^'^ as a, unit of length. The RG decoder 
will identify each of these clusters as a connected component of the syndrome for a sufficiently high 
RG level po ?a log2 R. Since the clusters are topologically neutral, the RG decoder will be able to 
annihilate each of them locally. This implies that the correcting operator Pec{S) will annihilate all 
defects in S. Finally, we invoke the "localization of high-level errors lemma" (Lemma 2 in Ref. [7]) 



to show that, in fact, Pec{S) coincides with P modulo stabilizers, see Section VII B Combining 
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this observation and Eq. ([9]) we prove Theorem [T] 

It should be noted that the full hierarchy of the RG decoder is not needed for the proof of the 
theorem; one can run the decoder at a single level {p = pq), and skip all the other levels. But, in 
practice, the hierarchical decoder performs better; it is able to correct errors with slightly higher 
energy barriers. Furthermore, the modified decoder using only one RG level would not have a 
constant threshold against local stochastic errors, see Appendix [Bj 



IV. PREVIOUS WORK 



Here we briefly review alternative routes towards quantum self-correction in topological mem- 
ories proposed in the literature. Most of them focus on finding new mechanisms for suppressing 
diffusion of topological defects (here and below we only consider zero-dimensional defects). Ar- 
guably, the simplest of such mechanisms would be to have no topological defects in the first place. 
Unfortunately, this seems to require four spatial dimensions. The 4D toric code [18] provides the 
only known example of a truly self-correcting quantum memory. As was shown by Alicki and 
Horodecki's [H], the memory time of the 4D toric code grows exponentially with the lattice size for 
small enough bath temperature. The first 3D topological memory in which diffusion of defects is 
constrained by superselection rules was proposed by Chamon [11] , see also [TT] . Topological defects 
in this model have a limited mobility restricted to certain subspaces of M'^ or have no mobility at 
all. However, the model has no macroscopic energy barrier that could suppress the diffusion. 2D 
topological memories in which diffusion of anyons is suppressed by effective long-range interactions 
were studied by Chesi et al [16] and Hamma et al [27]. A quenched disorder and Anderson local- 
ization were proposed as a means of suppressing propagation of defects at the zero temperature by 
Wootton and Pachos [46] and, independently, by Stark et al [M|, see also [10]. A no-go theorem 
for quantum self-correction based on 3D stabilizer Hamiltonians in which ground state degeneracy 
does not depend on lattice dimensions was proved by Yoshida [U]. A different, very promising, 
line of research initiated by Pastawski et al flOj focuses on quantum memories in which active error 
correction is imitated by engineered dissipation driving the memory system towards the ground 
state (as opposed to the Gibbs state). Finally, let us emphasize that quantum self-correction is 
technically different from the thermal stability of topological phases, see for instance |13ll291 [3 H l38j. 
While the latter attempts to establish the presence (or absence) of topological order in the equilib- 
rium thermal state, quantum self-correction is mostly concerned with the relaxation time towards 
the equilibrium state. 
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V. ANALYSIS OF THERMAL ERRORS 

We begin by setting up some terminology and notations. A ground state of the memory Hamil- 
tonian will be referred to as a vacuum. It will be convenient to perform an overall energy shift such 
that the vacuum has zero energy. A Pauli operator is an arbitrary tensor product of single-qubit 
Pauli operators X, Y, Z and the identity operators /. We will say that a Pauli operator creates m 
defects iff P anticommutes with exactly m stabilizer generators , . Equivalently, applying P 
to the vacuum one obtains an eigenvector of H with energy m. For example, using the explicit 
form of the generators, see Fig. [T| one can check that single-qubit X or Z errors create 4 defects, 
while Y errors create 8 defects. We will say that a Pauli error P is corrected by the decoder iff 
Pec{S{P)) = ^yPG, where S{P) is the syndrome of P, G is a product of stabilizer generators and 
7 is an overall phase factor. Let N = 2L^ be the total number of qubits. Note that is also the 
number of stabilizer generators. 

A. Energy barrier 

Let r = {Pq, Pi, . . . , Pt) be a finite sequence of Pauli operators such that the operators Pi and 
Pi+i differ on at most one qubit for all < i < We say that F is an error path implementing 
a Pauli operator P iS Pq = I and Pt = P. Let rrii be the number of defects created by Pi. The 
maximum number of defects 

m(F) = max 

0<i<t 

will be called an energy cost of the error path F. Given a Pauli operator P, we define its energy 
barrier A(P) as the minimum energy cost of all error paths implementing P, 

A(P) = niin m(F). 

Although the set of error paths is infinite, the minimum always exists because the energy cost is a 
non-negative integer. In fact, it suffices to consider paths in which Pi are all distinct. The number 
of such paths is finite since there are only finitely many Pauli operators for a given system size. 

It is worth emphasizing that an operator P may have a very large energy barrier even though 
P itself creates only a few defects or no defects at all. Consider as an example the 2D Ising model, 
H = —(1/2) ZuZy, where the sum runs over all pairs of nearest neighbor sites on the square 

lattice of size Lx L with open boundary conditions. Then the logical-X operator P = (S)u-^u has 
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an energy barrier A(P) = L since any sequence of bit-flips implementing P must create a domain 
wall across the lattice. 

It is clear that a Pauli operator P acting non-trivially on n qubits has energy barrier at most 
0{n). 



B. Lower bound on the memory time 

A naive intuition suggests that a stabilizer code Hamiltonian is a good candidate for being a 
self-correcting memory if there exists an error correction algorithm, or a decoder, that corrects all 
errors with a sufficiently small energy barrier. Errors with a high energy barrier can confuse the 
decoder and cause it to make wrong decisions, but we expect that such errors are unlikely to be 
created by the thermal noise. The following lemma makes this intuition more rigorous. Let / be 
the maximum energy barrier of Pauli operators that appear in the expansion of the quantum jump 
operators Aa^ui or Aa^u}Aa,uj, see Section 
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We will see later that A^^Ld act on a constant number 
of qubits (see Proposition |4] below) . It implies that / = 0(1). Below m is an arbitrary energy 
cutoff. 

Lemma 1. Suppose an error correction algorithm s i— )• Pec{s) corrects any Pauli error P with the 
energy barrier smaller than m + 2/. Let <I>ec he the corresponding decoder defined by Eq.^. Let 
Qm be the projector onto the subspace with at least m defects. Then 

e{t) = W^eMt)) - p(0)||i < 0{tN)TiQ„^e-^" (10) 

for any initial state p(0) supported on the ground subspace of H. The time evolution of p{t) is 
governed by the Lindblad equation, Eq. ([T]), with the inverse temperature (3 of the bath. 

Proof. We begin with the following simple observation. Let H be the linear map taking the 
commutator with that is, H{X) = HX — XH. 

Proposition 1. The map H commutes with the Lindblad generator C. The time-evolved state p{t) 
commutes with the memory Hamiltonian H. In addition, p{t) = e^*(/9(0)). 

Proof. S incG ^a^oj transfers energy uj from the memory to the bath, we have the following identity; 

Aa,u^H = HAa^^ + ujAa,,^ and A^^^H = HA^^^ - ujA^^^. (11) 
From this one easily gets HC = CH. It follows that 

p{t) = e-^*+^*(p(0)) = e^* o e-^*(p(0)) = e^*(p(0)). 
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since the initial state p(0) commutes with H. It also implies that p{t) commutes with H. □ 

Let T> = ker H be the set of all operators that are commuting with the Hamiltonian H. Every 
operator in D is block-diagonal in the energy eigenstate basis. Since p(0) is supported on the 
ground subspace of H, we have p(0) G D. Below we only consider states from D and linear maps 
preserving D. Define an orthogonal identity decomposition 

/ = n_ + n+ 

where n_ projects onto the subspace with energy < m + f and n_|_ projects onto the subspace 
with energy > m + f. Introduce auxiliary Lindblad generators 

^-(P) = {Ba,wpBl^^^ - ]j^{p,B\^^Ba,uA \ , where Ba,uj = Il-Aa,u> (12) 

and 

'^+'\P) = ^^K(^^'^)\^o,,upC\^ui-\{P^C]^,u,Ca,S\^ where C^,^ = n+A^,^. (13) 
Simple algebra shows that £_ and preserve T) and their restrictions on 2? satisfy 

£ = £_+£+. (14) 
It is useful to note that any X G D commutes with II-i-. By abuse of notations, we shall apply 



Eq.(14) as though it holds for all operators. 



Proposition 2. For any time t > and for any state pQ supported on the ground subspace of H 
one has 

$ec(e^-*(po)) = Po. (15) 

Proof. Since all maps are linear, we may assume po = \g) {g\ is a pure state. Then, e^~*(po) 
is in the span of states of form \ip) = H-En ■ ■ ■Ii-E2ll-Ei\g) , where Ei are Pauli operators 
that appears in the expansion of A^^^ or Aa^i^Aa,uj- This follows from the Taylor expansion of 
e^"*. Since Pauli errors map eigenvectors of H to eigenvectors of H, we conclude that either 
= 0, or lip) = En ■ ■ ■ E2E1 \g). Furthermore, the latter case is possible only if the Pauli operator 
E = En ■ ■ ■ E2E1 has energy barrier smaller than m + 2/. Indeed, definition of n_ implies that 
EjEj-i ■ ■ ■ El creates at most m + f — 1 defects for all j = 1, . . . , n. By assumption, each operator 
Ej can be implemented by an error path with energy cost at most /. Taking the composition of 
all such error paths one obtains an error path for E with energy cost at most m + 2/ — 1 and thus 
$ec will correct E. Since ^ec ° is a TPCP map, we must have ^'ec(e^~*(/Oo)) = Po- D 



16 



Proposition 3. For any decomposition C = £_ + £+ one has the following identity: 



+ / dse^-(*-*)£+e^^ (16) 





Proof. Use the identity 
ds 

□ 

Proposition 4 ([15]). Each operator A^^uj nets non-trivially only on 0(1) qubits. Furthermore, 

||£+|k.sup^±^ = 0(iV). 
X ll^lli 

Proof Let Aa{t) = e*-^*A„e-*^* such that A a^i^ is the Fourier component of Ao^^t) with the Bohr 
frequency cj, that is, Aa{t) = Yluj^~^^^^a,uj- Recall that each operator Aa acts on 0(1) qubits. 
Since H is a sum of pairwise commuting terms, we can represent Aa{t) as Aa{t) = e*^"*AQe~*^"* 
where Ha is obtained from H by retaining only those stabilizer generators that do not commute 
with Aa- All such generators must share at least one qubit with A^. Therefore Ao,(t) and Aa^^o 
act non-trivially only 0(1) qubits. Furthermore, since has 0(1) distinct integer eigenvalues, 
Aa{t) has only 0(1) distinct Bohr frequencies. The bound < 1 follows trivially from our 

assumption < 1. The norm of is then bounded from Eq. ^ using triangle inequality, 

ll^^^lli < ll^ll • ll^lli, and Eq. (|4]). □ 

Now we are ready to bound e{t). Let po = p(0). First, using Proposition [T] one gets p{t) = 
e^*(/9o)- Applying Propositions 2|3 



one arrives at 

e{t)< rds||e^-(*-^)£+e^^(po)||i <t- max ||£+ e^^(po)||i. (17) 

Jo 0<s<t 

We shall use an identity 

C+{X) = C+iQrnXQrr,) (18) 

valid for any X £ D. Indeed, any quantum jump operator A^^uj changes the energy at most by /, 
so that C+{Q:}^X) = Cj^{XQ^) = for any X £ V (note that any X £ V commutes with Qm). 
We arrive at 

e(t) <t - ||£+(Q„,e^^(/9o)Q„)||i <t - ||£+||i- ||g„e^^(po)Qm||i <0(tiV)TrQ„e^^(po), (19) 
where the maximization over s is implicit. Since the ground state energy of H is zero, one has 

Po = Zppppo, (20) 
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where is the partition function. It yields 

TrQ„e^^(po) = Tr po e^*^(Q„,) = Z^Ti pppQe'^''{Qra) = 2/3(po, e^*'(Qm))/3, (21) 

where we introduced the Liouville inner product {X,Y)ii = Trp^Xty. We denoted by C* the 
hnear map adjoint to C (the one describing time-evolution in the Heisenberg picture) with respect 
to the Hilbert-Schmidt inner product. The detailed balance condition Eq. ([s]) implies that the 
linear map C* is self-adjoint with respect to the Liouville inner product, 

{X,C{Y))p = {C{X),Y)p, (22) 

see [3] . It can be easily checked using the identities 

A-a,ujPp = e'l^"^ ppAa,uj and A^-u = A^^^^ (23) 



and Eq. (|3|. It follows that the map e'^** is also self-adjoint with respect to the Liouville inner 
product. Hence, we have 

1VQ„e^^(po) = Zp{po,e^*'{Q^))p = Zp{e^*%po),Qm)0 = TV e^*'^(po)Qme"''^ < TrQ^e"^^. 

(24) 

Here the last inequality follows from the fact that e^*'^ is a unital CP map and from po < I. □ 

C. Choosing the optimal number of qubits 

Let us fix the inverse temperature (3 and ask what is the optimal system size that minimizes 
the upper bound on the storage error e{t) derived in Lemma[T] The dimension of the subspace with 
exactly n defects equals 2^^^^ times the binomial N choose n coefficient since the total number of 
potential defects locations is N (recall that the 3D Cubic Code has exactly N stabilizer generators). 
Choosing any constant < a < 1 and setting k = k{L) we obtain 



n>m 



(25) 



The Boltzmann factor e~"-l^^ is responsible for the self-correcting behavior. It supports our initial 
intuition that errors with a high energy barrier (at least m) are unlikely to be generated by the 
thermal noise. On the other hand, the factor (1 -|- e~(^~")^)^ represents the entropy contribution. 
Loosely speaking, it takes into account the number of error paths with a given energy cost. The 
optimal choice of is determined by the tradeoff between the Boltzmann factor and the entropy 
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factor which depends on the scahng of the energy barrier m. For example, suppose one chooses 
N < e^^"")^ such that the entropy factor is upper bounded by a constant. Then Eq. (25) yields 



e(t) < 0{t)N2''e-''^"' for all N < e^i-*^)^. (26) 



In Section VII we will argue that the 3D Cubic Code (and any other stabilizer code having 
the topological order and obeying the no-strings rule as stated in [7J) admits an efficient error 
correction algorithm that corrects all errors with the energy barrier smaller than m = c log (iV) for 
some constant c > 0. For such codes one gets 

e{t) < 0{t) ■ 2^ ■ N^-""!^ for all N < e^^"'^)/^. (27) 

This is equivalent to the statement of Theorem [TJ 



VI. RENORMALIZATION GROUP DECODER 



Recall that the syndrome measurement reveals locations of defects (flipped stabilizer generators) 
created by an unknown error. The renormalization group (RG) decoder attempts to annihilate 
the defects comprising the syndrome S by dividing them into disjoint connected clusters S = 
Ci U . . . U Cm and then trying to annihilate each cluster Ca individually. More specifically, the 
decoder checks whether Ca can be annihilated by a Pauli operator Pa supported on a sufficiently 
small spatial region b{Ca) enclosing Ca- If such local annihilation operator Pa exists, the decoder 
updates the syndrome by erasing all the defects comprising Ca, records the operator P^, and moves 
on to the next cluster. If Ca cannot be annihilated, the decoder skips it. In general, the annihilation 
operator Pa might not be unique. However, if the enclosing region b{Ca) is small enough to ensure 
that no logical operator can be supported on 6(Ca), all annihilation operators Pa must be equivalent 
modulo stabilizers and the choice of Pa does not matter. 

After all clusters Ca have been examined, the decoder is left with a new configuration of defects 
S", which is typically smaller than the original one. If no defects are left, i.e., S' = 0, the decoder 
stops and returns the product of all recorded Pauli operators Pa- If 5' 7^ 0, the decoder applies 
a scale transformation increasing the unit of length by some constant factor and repeats all the 
above steps starting from the syndrome S' - The scale transformation potentially merges several 
unerased clusters Ca into a single connected cluster whereby giving the decoder one more attempt 
to annihilate them. 

The full decoding algorithm is the iteration of partitioning the defects into the connected clusters 
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and calculating the annihilation operators. It declares failure and aborts if the recorded operator 
cannot annihilate all the defects before the rescaled unit length is comparable to the lattice size. 

A detailed implementation of the RG decoder must be tailored to a specific lattice geometry and 
a stabilizer code under consideration. It must include a precise definition of the connected clusters 
of defects Ca and the enclosing regions h{Ca)- It must also include an algorithm for choosing the 
annihilation operators Pa, a schedule for increasing the unit of length, and clearly stated conditions 
under which the decoder aborts. In the rest of this section we describe an efficient implementation 
of the RG decoder for arbitrary 3D stabilizer codes satisfying topological order conditions defined 
in [7]. The only part of this implementation specialized for the 3D cubic code is the "broom 



algorithm" of Section VI E 



A. Metric convention and terminology 



Let A be the regular 3D cubic lattice of linear size L with periodic boundary conditions along 
all coordinates x,y,z. We shall label sites of A by triples of integers {i,j,k) defined modulo L 
and measure the distance between sites using the ^oo-metric. In other words, the distance d{u, v) 
between a pair of sites u and v is the smallest integer r such that u and v can be enclosed by a 
cubic box with dimensions rxrxr. For example, d{u, v) = 1 whenever u and v belong to the same 
edge, plaquette, or elementary cube of the lattice. Each site of A represents one or several physical 
qubits (two qubits for the 3D Cubic Code). Each elementary cube c represents a spatial location of 
one or several stabilizer generators (two generators for the 3D Cubic Code). A generator located 
at cube c may act only on qubits located at vertices of c. We shall label each elementary cube by 
coordinates of its center, the triple of half-integers (z, j, k) defined modulo L. The distance (i(c, c') 
between a pair of cubes c and c' is the distance between their centers. For example, (i(c, c') = 1 
whenever c and c' share a vertex, an edge, or a plaquette. 

A defect is a stabilizer generator whose eigenvalue has been flipped as a result of the error. We 
shall use a term cluster of defects^ or simply cluster for any set of defects. Define the diameter 
of a cluster d{C) as the maximum distance d[c, c') where c, c' E C. Here and below the distance 
between defects is defined as the distance between the cubes occupied by these defects. Given two 
non-empty clusters C and C , define a distance d{C,C') as the minimum distance d{c,c') where 
c £ C and c' G C . Given an integer r, we shall say that a cluster C is connected at scale r, or 
simply r-connected, if C cannot be partitioned into two proper subsets C = C U C" such that 
d{C', C") > r. A maximal r-connected subset of a cluster C is called a r-connected component of 
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C. The minimal enclosing box h{C) of a cluster C is the smallest rectangular box B enclosing all 
defects of C such that all vertices of B are dual sites of A. Note that the minimal enclosing box 
b{C) is unique as long as d{C) < L/2 (if a cluster C has diameter L/2, one may have two boxes 
with the same dimensions enclosing C that 'wrap' around the lattice in two different ways). 

B. Topological stabilizer codes 

The RG decoder can be applied to any stabilizer code satisfying topological order conditions 
defined in For the sake of completeness, we state these conditions below. Let Q be the Abelian 
group generated by the stabilizer generators. Elements of G will be called stabilizers. Let S{P) 
be the syndrome of a Pauli operator P, that is, the set of all stabilizer generators anticommuting 
with P. The syndrome can be viewed as a cluster of defects. 

Definition 1. A stabilizer code has topological order at scale Ltqo if any Pauli operator supported 
on a cube of linear size Ltqo satisfies the following conditions: 

(1) If P commutes with all stabilizers then P is stabilizer itself; 

(2) If S{P) 7^ then there is a stabilizer G G G such that PG is supported on the 1-neighborhood 
of the minimum enclosing box b{S{P)). 

A family of stabilizer codes {C{L)}l is called topological if there exists a constant 7 > such that 
each code C{L) has topological order at scale Ltqo ^ L'^. 

The 3D Cubic Code satisfies this topological order condition with Ltqo = ^L, see [26j. Strictly 
speaking Ltqo = L — 2 for the 3D Cubic code, but in order to avoid unnecessary complications due 
to boundaries, we always assume that Ltqo < ^L. Below we consider only topological stabilizer 
codes. 

Definition 2. A cluster of defects C is called neutral if it can be created from the vacuum by 
a Pauli operator P supported on a cube of linear size Ltqo- Otherwise, the cluster is said to be 
charged. 

For example, the 2D toric code [34J has two types of defects: magnetic charges (flipped plaquette 
operators) and electric charges (flipped star operators). A cluster of defects G is neutral iff G 
contains even number of magnetic charges and even number of electric charges. 

It follows from Definition [T] that any neutral cluster of defects G can be annihilated by a Pauli 
operator supported on the 1-neighborhood of the minimum enclosing box 6(C). 
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C. RG decoder 

We are now ready to define our RG decoder precisely. Recall that d{C) is the diameter of a 
cluster C, and Ltgo < by convention. 



TestNeutral 

Input S : a set of defects, Output P : a Pauli operator. 

1. Compute the minimal enclosing box B S. 

2. if d{B) > Ltqo, then return /. 

3. Try to compute a Pauli P supported on the 1-neighborhood of B such that S{P) = S. 

4. if a consistent P is found then return P else return /. 



The topological order condition of Definition [T] implies that TestNeutral computes the correcting 



Pauli operator for any neutral cluster. Step 1 is easy as we discuss in the end of Section VI D The 
specification of Step 3 depends on the code, but it always has an efficient implementation using 
the standard stabilizer formalism [22j. In general, the condition S{P) = S can be described by 
a system of 0{V) linear equations over 0(y) binary variables parameterizing P, where V is the 
volume of B. The running time is then O(y^) by the Gauss elimination. In the special case of the 
3D Cubic Code, there is a much more efficient algorithm running in time 0{V) which we describe 
in Section FVl Ei 

Let pm be the largest integer such that 2^'^' < min(|L, Ltqo)- For any integer < p < pm, we 
define the level-p error correction EC(p): 



EC(p) 

Input S : a syndrome. Output P : a Pauli operator. 

1. Partition S into 2P-connected components: 5 = Ci U . . . U Cm- 

2. For each component, compute Pa = TestNeutral(Ca). 

3. return the product P1P2 ■ ■ ■ Pm- 

The overall running time of EC(p) is polynomial in the number of qubits N . Step 1 can be 
done, for example, by examining the distance between all pairs of defects, forming a graph whose 
edges connect pairs of defects separated by distance < 2^, and finding connected components of 



this graph. A more efficient algorithm with the running time 0{N) is described in Section VI D 
Since V = 0{N) and m = 0{N), we see that the worst-case running time of EC(p) is 0{N'^). For 
instance, one can consider nested boxes, near the faces of which many defects lie. However, clusters 
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are created from the vacuum with some probabihty which we expect to be smaller than, say, ^. 
So the number of clusters that have overlapping minimal enclosing box appears with exponentially 
small probability. On average, the running time of EC(p) will be 0{N). 
The full RG decoding algorithm is as follows. 



RG Decoder 

Input S : the syndrome, Output Pec : a Pauli operator. 

1. Set Pec = I. 

2. for p = to pm do 

Let Q =EC{p){S). 

Update Pec ^ Pec- Q and S ^ S e S{Q). 
end for 

3. if = then return Pec else declare failure. 

Here the notation S(BS{Q) stands for the symmetric difference of the sets S and S{Q) or addition 
modulo two, if the syndromes are represented by binary strings. To conform with conventions 



made in Section II C , we may let the decoder on failure return an arbitrary Pauli operator that is 
consistent with the measured syndrome. Of course, this is no better than initializing the memory 
with an arbitrary state. The discussion above implies that the RG decoder has running time 
O(A^^logA^) in the worst case, and O(A^logA^) in the case of sparse syndromes. 



D. Cluster decomposition 



Given a length scale r, the cluster decomposition of defects is to partition the defects into 
maximally connected subsets (connected components) of the syndrome at scale r. Naively, the 
task to compute the decomposition of all the defects into clusters will take time O(m^) where m 
is the total number of defects. The density of defects will typically be constant irrespective of the 
system size, and the computation time for decomposition will be 0{N'^), where is the volume 
of the system. However, by exploiting the geometry of simple cubic lattice, we can do it in time 
0{N). This is the optimal scaling since we have to sweep through the whole system anyway to 
identify the position of defects. 

If r = 1, the problem is to label the connected components of binary array [HES]. Given a defect 
uq, we can compute the connected component containing uq in time 0{m) where m is the number 
of defects in the component. One prepares an empty queue (first-in-first-out data structure), and 
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puts uo into it. The subsequent computing is as follows: (i) Pop out the first element u from 
the queue, and of the neighborhood put the unlabeled defects into the queue and label them, (ii) 
Repeat until the queue becomes empty. Every defect in a connected component j is stored in 
the queue only once. Hence, this process computes the component j of a given defect in time 
proportional to the number nij of defects in j. One examines the whole system in some order and 
finds the connected component whenever there is an unlabeled defect. The total computation time 
is proportional to N + 0(1) J2j "^j = 0{N), since the connected components are disjoint. 

For r > 1, the algorithm begins by dividing the whole lattice into boxes of linear size r or 
smaller. The defects in a box certainly belong to a single connected component (recall that we use 
the Zoo metric). The defects in the boxes B, B' belong to the same component if and only if there 
is a pair u e B, v e B' of defects such that d{u,v) < r. In other words, we evaluate the binary 
function 



6iB,B') 



1 if there are u E B,v E B' such that d{u, v) < r, 
otherwise 



for each neighbor B' of B; if B does not meet B', we know that S{B, B') = 0. 

Given the table of S, we can finish computing the decomposition in time 0{{L/r)^) as in the 
r = 1 case. We show that the computation of S{B, B') can be done in time 0{r^) where D is the 
dimension of the lattice. Then, the total time to compute the table of 5 for all adjacent boxes will 
be 0{r^{L/r)^) = 0{N). Let B and B' be adjacent. For clarity of presentation, we restrict to 
D = 2. Suppose B and B' meet along an edge parallel to x-axis. Since any difference jx — .t'| of 
x-coordinates of the defects in BD B' is at most r, we only need to compare y-coordinatcs. That 
is, the problem is reduced to one dimension. It suffices to pick two defects from B, B' respectively 
that are the closest to the x-axis. If the y-coordinates differ at most by r, then S{B,B') = 1; 
otherwise, S{B, B') = 0. 

Suppose B and B' meet at a vertex. Without loss of generality, we assume B is in the third 
quadrant, and B' is in the first quadrant. Define a binary function 5' on B' as 



1 if there is a defect {x,y) G B' where x <i and y < j, 
otherwise. 



otherwise. 

The function table of S' is computed in time O(r^). It is important to note that S'{i,j) = 1 implies 
S'{i + = d'{i,j + 1) = 1. One starts from the origin and sets S'{^, ^) = 1 if there is a defect 
at ^); otherwise S'{^,^) = 0. Here, {^,^) means the elementary square in the first quadrant 



24 



that is the closest to the origin. Proceeding by a lexicographic order of the coordinates, one sets 
5'{i,j) = 1 if 5'(i — = 1, or 5'{i,j — 1) = 1, or there is a defect at otherwise S'{i,j) = 0. 

It is readily checked that this procedure correctly computes 6' . Equipped with this 5' table, we 
can immediately test for each defect in B whether there is a defect in B' within distance r. Thus, 
we have computed 6{B,B') in time O(r^) + 0{m) where m is the number of defects in B, which 
is at most O(r^). The computation of 5 in higher dimensions is similar. 

The computation of the minimal enclosing box for each cluster is also efficient. Given the 
coordinates of the m points in the cluster, we read out, say, x-coordinates xi, . . . , Xm- The minimal 
enclosing interval Bx of xi, . . . ,Xm under periodic boundary conditions, is the complement of the 
longest interval between consecutive points Xi and Xj+i which can be computed in time 0{m). B^ 
is unambiguous if the diameter of the cluster is smaller than L/2. The minimal enclosing box is 
the product set Bx x By x Bz, whose vertices are computed in time 0{m). 

E. Broom algorithm 

Now we describe an efficient algorithm for the 3D cubic code that tests whether a cluster is 
neutral. If the test is positive, the algorithm also returns a Pauli operator E that annihilates the 
cluster. Recall that the 3D cubic code has property that if the cluster is neutral, E is supported 
in the minimal box that encloses the cluster. This is because the erasure process [26] is possible as 
long as it does not encounter a defect. 

By the duality between Z- and X-type operator, we assume E is consisted entirely of Z errors. 
Since in the cubic code there is only one X-type of stabilizer generators, the positions of defects 
completely determines the X-type syndrome. We shall use double-letter indices to represent single- 
site Pauli operators; for example, IZ = I <^ Z, ZZ = Z Z, II = I <^ I , etc. Let the qubits have 
integer coordinates, and the defects have half-integer coordinates. 

Elementary syndromes are depicted in Fig. |4j We fix a box B that encloses all the defects in 
the neutral cluster. We will sweep the defects to bottom-left-back corner. Since each defect is 
a Z2 charge, they will disappear in the end. The algorithm begins with the top-right foremost 
vertex of B on the dual lattice. If there is a defect at {x + + ^, z + we apply ZI at (x, y, z) 
to eliminate it. This might create another defects as there are four defects in the elementary 
syndrome. Important is that the potentially new defects are all contained in the box B we started 
with. Continuing with ZI we push all the defects in the foremost plane of B to the left vertical 
line and the bottom horizontal line. During this process, we record where ZI has been applied. 
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FIG. 4. Elementary syndromes created by Z errors. The vertices which are on the dual lattice, represent 
the defects created by the error at the center. The elementary syndrome by ZI is used to push the defects 
to the bottom and to the left. The syndromes by errors of weight three is used to push the defects to the 
bottom-left corner. 



For the defects on the vertical line at the left or on the horizontal line at the bottom, we use the 
operator of weight 3 to further move the defects to the bottom-left corner. See Fig. |4j This will in 
general create more defects behind, all of which are still contained in B. Thus, we have moved all 
the defects on the foremost plane to the bottom-left corner except for the three sites i, u, v: 

t o (28) 

u — V 

That is, if E' is the recorded operator during the sweeping process, the syndrome S{EE') C B has 
potential defects only at t, n, v on the foremost plane. 

Let o be at [xq + \-,yo + \, Zq + \)- By considering the multiplication by suitable stabilizer 
generators , we can assume that EE' is the identity on the plane x = Xo, except {xo,yo, Zo)- 
Since there is no defect at o, the operator at {xq, Uo, Zq) has to commute with XX; it is either II or 
ZZ. Applying ZZ if necessary, the operator at (xq, Vo, Zq) will become //, and the defects at t, u, v 
will disappear. In this way, we have successfully pushed all the defects to the next-to- foremost 
plane. We emphasize that the box B still envelops all the defects, and further B can be shrunk in 
one direction. 

Due to the threefold symmetry of the cubic code, one can carry out this broom algorithm along 
any of three directions. We will have at last a box B of volume 1 that encloses all defects. The 
defects in the cluster must be from one of the three elementary syndrome cubes created either by 
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Z/, /Z, or ZZ^ which are easily eUminated. It is clear that in time O(l^) the error operator has 
been computed up to stabilizer, where V is the initial volume of the minimal enclosing box of the 
cluster. 

Note that a similar sweeping algorithm is applicable to the toric code family for any dimensions. 

VII. CORRECTABILITY OF ERRORS WITH A LOGARITHMIC ENERGY BARRIER 

Let P be an unknown Pauli error. Suppose we are promised that P has a sufficiently small 
energy barrier, namely, A(i-') < clog L, for some constant c that will be chosen later. In this section 
we prove that any such error P will be corrected by the RG decoder for any topological stabilizer 
code satisfying the no-strings rule. For completeness, we define the no-strings rule formally. 

A. No-strings rule 

Let P be any Pauli operator supported on a cube with linear size at most Lfgoj and let S = S{P) 
be its syndrome. Suppose S can be partitioned into a pair of disjoint clusters, S = Ci U C2, such 
that d{Ca) < r and d{Ci^C2) > ar for some r > 1 and a > 1. We call such P a string segment 
with aspect ratio a. Intuitively, a string segment with a large aspect ratio, a ^ 1, is an operator 
capable of creating a pair of well-separated clusters of defects from the vacuum. We say P is 
a trivial string segment if the clusters of defects Ci , C2 are neutral (note that the full syndrome 
S" = Ci U C2 is always neutral due to our assumptions on P). Throughout this section we only 
consider topological stabilizer codes following Definition [l} Let a > 1 be any constant. 

Definition 3. A family of topological stabilizer codes obeys the no-strings rule with a constant a 
if any logical string segment with the aspect ratio greater than a is trivial. 

The 3D Cubic Code obeys no-string rule with a = 5 under the current ^oo-metric convention |26] . 

B. Errors with a logarithmic energy barrier 

In order to assess the capability of the RG decoder rigorously, we will need some of the results 
from Ref. [7j. Assume throughout this section that a family of topological stabilizer codes {Cl}l 
with Ltqo > L'^ obey the no-strings rule with some constant a. Define 

a?) = {War 

for notational convenience. 
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Definition 4 ([7]). A syndrome S is called sparse at level p S can be partitioned into disjoint 
clusters, 5 = Ci U . . . U C„, such that 

d{Ca)<aP) and d{CaUCb)>^{p+l) 

for all a ^ b. Otherwise, S is called dense at level p. 

(The term 'dense' should not be confused with the 'dense subset' of a topological space, with 
which any nonempty open set intersects; the two notions are even conceptually different. Here, the 
denseness is just the negation of sparseness.) 

Proposition 5 (Lemma 1 of [7J). If S is dense at all levels q = 0, . . . ,p then S contains at least 
p + 2 defects. 

Let r = (Pq, Pi, ... , Pf) be an error path implementing P with the energy cost m(r) = m, see 
Section V A Here Pq = I, Pt = P, while Ej = PjPj-i are single-qubit Pauli operators for all j. A 



sequence of syndromes S{j) = S{Pj), j = 0, . . . ,t is called a level-0 syndrome history. Note that 
S{0) = and S{t) = S. For any integer p > 1 define inductively a level-p syndrome history as a 
subsequence of {5'(j)}j=o,...,t including only S'(O), S{t) and all syndromes S{j), < j < t which 
are dense at all levels q = 0, . . . ,p — 1. Proposition [5] implies that there is a level Pmax < m — 1 
such that the level-pmax syndrome history contains only the initial empty syndrome 5(0) = and 
the final syndrome S{t) = S. The key technical result is the following. 

Proposition 6 (Lemma 2 of [7J). Let S' and S" be any consecutive pair of syndromes in the level-p 
syndrome history. Let E be the product of all errors Ej that occurred in the error path T between 
S' and S" . If Am{2 + ^{p)) < Ltqo then there exists a stabilizer G such that E ■ G is supported 
on the ^{p) -neighborhood of S' U S" . 

This is useful to characterize a low energy barrier error. 

Lemma 2. Let P be any Pauli error, S = S{P) be its syndrome, and m = A(P) be its energy 
barrier. Suppose 

8m(10a)"' < Ltqo. (29) 

Then, there exists a stabilizer G £ G such that P ■ G has support on the (^{m) -neighborhood of S. 
Any R-connected component of S is neutral for 2^{m) < R< 4^(m). 
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Proof. Let us apply Proposition [6] to the level Pmax , the smallest integer such that the syndrome 
history at that level has only the initial and final syndrome. Since Pmax < the condition 
4m(2 + i{pmax)) < Ltqo is Satisfied. We have S' = 0, S" = S", and E = P. Hence, there exists a 
stabilizer G £ G such that P ■ G is supported on the ^(pmaa;)-neighborhood of S. It proves the first 
statement of the lemma. 

Let r = ^{m) = (lOa)"^. Choose any R such that 2r < R < Ar and let Ca be any i?-connected 
component of S. Since Ca contains at most m defects, the diameter of Ca is at most mR. Restricting 
P • G on the r-neighborhood of Cq, we obtain a Pauli operator Pa supported on a cube of linear size 
at most mR + 2r < Arm + 2r < Ltqo by assumption. Furthermore, the support of Pa is separated 
from (Pa)^^(P • G) by distance at least i? — 2r > 0. Hence, Pa creates the cluster Ca from the 
vacuum. Therefore, Ca is neutral. □ 

We wish to have a well-separated cluster decomposition. 

Lemma 3. Let S he any cluster ofm > defects. For any integer /u > 1, there exists a nonnegative 
integer p < m and a decomposition S = CiL) ■ ■ - DCn such that d{Ca) < and d(Ca, C^) > 2-4P^ 
for all a ^ b. 

Proof. Let us say that a partition of S into clusters is /i-good if it satisfies the statement. By 
grouping all defects occupying the same elementary cube into a cluster, one obtains a partition 
S = CiU . . .UCg. Obviously, g < m, and d{Ca) < ^. If this partition is not 0-good, then g >2 and 
there is a pair, say, Ci, C2 such that d(Ci, C2) < 2^. Merging Ci and C2 into a single cluster C2, 
one obtains a partition 5 = C2 U C3 U . . . U Cg where ^(C^ < A^. If this partition is not 1-good, 
then g >2> and one can repeat the merging again. After at most g — 1 iterations, one arrives at a 
good partition. □ 

Note that the minimal enclosing boxes of distinct cluster do not overlap, since 

d{b{Ca), b{Cb)) >2-APf,- 4V - 4^^ = 0. 

The following is the promised property of the RG decoder. 

Lemma 4. Let P be any Pauli error with energy barrier m = A(P). Suppose 

(80a)™ < Ltqo- 

Then calling the RG decoder on the syndrome S{P) returns a correcting operator P^c such that 
PPec is a stabilizer. 
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Thus, the RG decoder corrects P if A(P) < jpg^lo^) ^ogL. 

Proof. Let S = S{P) be the syndrome. Let p be the integer such that 2^(m) < 2^ < 4^(m). 
Setting ;U = 2^' in Lemma [sj S is decomposed into S = Ci U . . . U C„ such that d{Ca) < 2^ and 
d(Ca, Cb) > 2P'+i for ah a^b, where p' is an integer such that p < p' < 2m + p. By Lemma [2| each 
Ca is neutral for being a disjoint union of neutral 2P-connected components. The RG subroutines 
EC(s) with s = 0,l,...,p — 1, can only annihilate some neutral 2'^-connected components of 
Ca, which does not alter the neutrality of Ca- Therefore, the RG decoder from level-0 to p will 
annihilate each cluster Cq, and hence S at last. 

We need to show that P ■ P^c is a stabilizer, where Pec is the returned correcting operator. Let 
Ba be the (10a)™'-neighborhood of b[Ca)- Our assumptions imply that Ba has diameter smaller 
than Ltqo and distinct BaS do not intersect. By construction, the operators Pec and P ■ G have 
support in the union i?i U • • • U B^. Therefore, P ■ G ■ Pec = Qi - ■ ■ Qn, where Qa has support on 
Ba and has trivial syndrome. Topological order condition implies that Qa are stabilizers, so is the 
product. □ 

As was mentioned earlier, the full hierarchy of the RG decoder is not necessary to correct the 
error with the low energy barrier. A single level-p error correction with p proportional to log Ltqo, 
will be sufficient. We nevertheless include the hierarchy since in practice it corrects errors with 
slightly higher (although only by a constant factor) energy barrier at a marginal slowdown of 
the decoder. If one wishes to apply the decoder against random errors, the hierarchy becomes 
necessary. We discuss the character of the RG decoder against random errors in Appendix. 



C. Further specialization 



A closer analysis reveals a simplification of TestNeutral defined in Section VI C for the 3D Cubic 
Code. We defined TestNeutral to return the identity operator if a cluster turns out to be charged. 
The modified TestNeutral' just applies the broom algorithm and returns recorded operator in any 
case. It gives the same characteristic as stated in the Lemma |4j EC'(p) using TestNeutral' will 
transform a charged cluster Ca to a different cluster C'a, but C'a is still contained in b{Ca)- Due to 
Lemmajsj b{Ca) do not overlap at a high level p, and EC'(j») will eliminate neutral clusters at last. 
This specialized version of RG decoder is used in our numerical simulation of the next section. 
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VIII. NUMERICAL SIMULATION 

We test our theoretical bound by numerical simulations. The interaction of the memory system 
with a thermal bath is simulated by Metropolis evolution. As we wish to observe low temperature 
behavior we adopt continuous time algorithm by Bortz, Kalos, and Lebowitz (BKL) [6]. A pseudo- 
random number generation package RngStream by L'Ecuyer [35] was used. As before, the coupling 
constant in the Hamiltonian is set to J = 1/2 so a single defect has energy 1. Although the 3D 
Cubic Code is inherently quantum, it is relevant to consider only X-type errors (bit flip) in the 
simulation, thanks to the duality of the X- and Z-type stabilizer generators of the 3D Cubic Code. 
The simulation thus is purely classical. The errors are represented by a binary array of length 2L^, 
and the corresponding syndrome by a binary array of length . 

The memory time is measured to be the first time when the memory becomes unreliable. There 
are two cases the memory is unreliable: either the broom algorithm fails to remove all the defects 
so we have to reinitialize the memory, or a nontrivial logical error is occurred. It is thus necessary 
in our simulation to keep track of the error operator during the time evolution. In fact, most of 
the time, it was the broom algorithm's failure that made the memory unreliable. Nontrivial logical 
errors occurred only for very small system sizes L = 5,7. 

It is too costly to decode the system every time it is updated. Alternatively, we have performed 
a trial decoding every fixed time interval 




where /3 is the inverse temperature. Although the time evolution of the BKL algorithm is stochastic, 
a single BKL update typically advances time much smaller than Tec- So it makes sense to decode 
the system every Tec* The exponential factor appears naturally because BKL algorithm advances 
time exponentially faster as /? increases. It is to be emphasized that we do not alter the system by 
the trial decodings (a copy of the actual syndrome has been created for each trial decoding). 

The system sizes for the simulation are chosen such that the code space dimension is exactly 
2, for which the complete list of logical operators is known. If the linear size L is < 200, this is 
the case when L is not a multiple of 2, 15, or 63 [26j. For these system sizes, to check whether 
a logical operator is nontrivial is to compute the commutation relation with the known nontrivial 
logical operators. 

The measured memory time for a given L and /3 is observed to follow an exponential distribution; 
a memory system is corrupted with a certain probability given time interval. Specifically, the 
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probability that the measured memory time is t is proportional to e"*/"^. Thus the memory time 
should be presented as the characteristic time of the exponential distribution. We choose the 
estimator for the characteristic time to be the sample average T = ^X]?-^*- '^^^ deviation of 
the estimator will follow a normal distribution for large number n of samples. We calculated the 
confidence interval to be the standard deviation of the samples divided by ^/n. For each L, 400 
samples when f3 < 5.0 and 100 samples when f3 > 5.0 were simulated. The result is summarized in 

Fig.m 

Figure [3] clearly supports log Tmem = ^cP"^ + ■ ■ ■ . Figure [2] demonstrates the power law for small 
system size: 

71 r 2.93/3-10.5 

-'-mem, ^ -'^ 

We wish to relate some details of the model with the numerical coefficients. The Rigorous analysis 
of the previous section, gives a relatively small coefficient c of the energy barrier for correctable 
errors by our RG decoder. However, we expect that the coefficient of /3 in the exponent is the same 
as the constant c that appear in the energy barrier 

E = c log2 R 

to create an isolated defect separated from the other by a distance R. This is based on an intuition 
that the output P' of the decoder would have roughly the same support as the real error P for 
the most of the time, provided that the error has energy barrier less than A = c\og2Ltqo- Thus, 
an error of energy barrier less than A would be corrected by the decoder. Our empirical formula 
supports this intuition. It suggests that c = 2.93 log 2 = 2.03 ~ 2. 

Indeed, we can illustrate explicitly an error path that separates a single defect from the rest 
by distance 2^ during which only 2p + 4 defects are needed. This is an improvement in the upper 
bound on the energy barrier for separating a charged set of defects, for which it was c < 4 in [7]. 

Consider an error of weight 2 that creates 4 defects as shown in the top of Fig. [5] We call it 
the level-0 hook. The bottom sequence depicts a process to create a configuration shown at the 
bottom-left, which we call level-1 hook. One sees that level-1 hook is similar with ratio 2 to level-0, 
and is obtained from level-0 with extra 2 defects. One defines level-p hooks hierarchically. We 
claim that a level-p hook can be constructed from the vacuum using 2p + A defects. The proof is 
by induction. The case p = 1 is treated in the diagrams. Suppose we can construct level-p hook 
using 2p + A defects. Consider the 2"'^, 4*^^, 6*^, and 8*'^ steps in Fig. [5| They can be viewed as 
a minuscule version of level-p steps that construct a level-(p -|- 1) hook from the level-p hooks. It 
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FIG. 5. Construction of a hook of level 2 from the vacuum. The grid diagram represents the position and 
the number of defects in the {x = 2;)-plane. For each transition, an operator of weight 1 is applied. The 
total number of defects never exceeds 6. From a level-0 hook (the second diagram in the sequence), a level-1 
hook (the last in the sequence) is constructed using extra 2 defects. 



requires at most 2p + 4 + 2 defects to perform the level-p step; this completes the induction. It may 
not be obvious whether a high level hook corresponds to a nontrivial logical operator, but such a 
large hook is bad enough to make our decoder fail. 
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cubic code (right) under independent random bit-flip 
D toric) = 6.7(1)% and Pc(3D cubic) > 1.1%. 



Appendix A: Benchmark of the decoder 



Given a decoder, a family of quantum codes indexed by code length (system size) is said to 
have an error threshold pc if the probability for decoder to fail approaches zero in the limit of large 
code length provided the random error rate p is less than pc- We tested our decoder with respect 
to random uncorrelated bit-flip errors on the well-studied 2D toric code. The error threshold is 
measured to be 6.7(1)%. See Fig. [6| It is roughly two-thirds of the best known value 10.3% based 
on the perfect matching algorithm |18|. I28j. or 9% based on a renormalization group decoder of 
similar nature to ours |19j . This is remarkable for our decoder's simplicity and applicability. The 
3D cubic code has threshold > 1.1% under independent bit-flip errors. 



Appendix B: Threshold theorem for topological stabilizer codes 



In this section we prove that any topological stabilizer code can tolerate stochastic local errors 
with a small constant rate assuming that the error correction is performed using the RG decoder. 
We assume without lose of generality that each stabilizer generator is supported on a unit cube. 
Each site of the lattice may contain finitely many qubits. A generator at a cube c may act only 
on qubits of c. We shall assume that errors at different sites are independent and identically 
distributed. More precisely, let E{P) be the set of sites at which a Pauli error P acts non-trivially. 
We shall assume that 



Pr[^(P) = E] = {1- e)^-l-^lel-^l 



(Bl) 
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where < e < 1 is the error rate and V = is the total number of sites (the volume of the 
lattice). For example, the depolarizing noise in which every qubit experiences X, Y, Z errors with 



the probability p/3 each, satisfies Eq. (Bl) with the error rate e = 1 — (1 — p)'^, where q is the 
number of qubits per site. 

Theorem 2. Suppose a family of stabilizer codes has topological order, see Definition [7J Then, 
there exists a constant threshold eo > such that for any e < eq the RG decoder corrects random 
independent errors with rate e with the failure probability at most e"^^^''^ for some constant rj > 0. 



In the rest of this section we prove the theorem. Our proof borrows some techniques from 
2T \ I23 | |28]. specifically Section 5.1 of [23j. For reader's convenience, we briefly summarize the RG 



decoding algorithm below (see Section VI for details). 

Recall that we use ^oo-metric. A cube of linear size r thus has diameter r. Let S = S{P) be the 
syndrome of a Pauli error P considered as a set of defects (violated stabilizers). A subset M S 
is called r-connected iff M cannot be partitioned into a pair of disjoint proper subsets separated 
by distance more than r. A maximum r-connected subset of S is called an r-connected component 
of S. A cluster of defects M C 5 is called neutral if it can be created from the vacuum by a Pauli 
operator P supported in a cube of linear size Ltqo- The smallest rectangular box enclosing a cluster 
M will be denoted h{M). Any neutral cluster M can be created by a Pauli operator supported on 
the 1-neighborhood of b{M), see Definition [T] 

The level-p error correction EC(p) on a syndrome S is the following subroutine, (i) find all 
neutral 2P-connected components M of S, (ii) for each M found at step 1, calculate and apply a 
Pauli operator P supported on the 1-neighborhood of b{M) that annihilates M, and update the 
syndrome accordingly. Calling the full RG decoder on a syndrome S involves the following steps: 
(i) run EC(0), EC(1), EC([log2 itgoj ), (ii) if the resulting syndrome S is empty, return the 
accumulated Pauli operator applied by the subroutines EC(p). Otherwise, declare a failure. 

Below we shall use the term 'error' both for the error operator P and for the subset of sites E 
acted on by P, whenever the meaning is clear from the context. Let us choose an integer Q ^ 1 
and define a class of errors which are properly corrected by the RG decoder, see Lemma [5] below. 
We will see later that this class of errors actually includes all errors which are likely to appear for 
small enough error rate. 

Definition 5. Let ii^ be a fixed error. A site u ^ E is called a level-0 chunk. A non-empty subset 
of E is called a level-n chunk (n > 1) if it is a disjoint union of two level-(?7. — 1) chunks and its 
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diameter is at most Q"^ /2. 

The term 'chunk' is chosen in order to avoid confusion with 'cluster', which is used for a set 
of defects. Note that a level-n chunk contains exactly 2" sites. Given an error E, let be the 
union of all level-n chunks of E. If u G E^+i, then by definition u is an element of a level-(n + 1) 
chunk. Since a level- (n -|- 1) chunk is a union of two level-n chunks, u is contained in a level-n 
chunk. Hence, u G En, and the sequence En form a descending chain 



where m is the smallest integer such that Em+i = 0. Let Fi = Ei\ Sj+i, so -E = i^o U i^i U • • • U F^ 
is expressed as a disjoint union, which we call the chunk decomposition of E. 

Proposition 7. Let Q >6 and M he any Q^-connected component of Fn- Then M has diameter 
< and is separated from En \ M by distance > |Q"^^. 

Proof. We claim that for any pair of sites u £ Fn = En \ En+i and v £ En we have d{u, v) < 
or d{u,v) > Suppose on the contrary to the claim, that there is a pair u £ Fn and v £ En 

such that < d{u,v) < Let Cu ^ u and 3 v he level-n chunks that contains u and 

V, respectively. S incG th.6 d-ictmctcrs of Cu ^ are < (5"/2 and d{u,v) > Q", we deduce that Cu and 
C„ are disjoint. On the other hand, 



since Q > 6. Thus, Cu U C^, is a level-(n + 1) chunk that contains u which shows that u G En+i- 



Note that in the chunk decomposition a Q"-connected component P of En may not be separated 
from the rest E\P hy distance > Q^. 

Lemma 5. Let Q > 10. // the length m of the chunk decomposition of an error E satisfies 
Qm+i ^ Lfgo, then E is corrected by the RG decoder. 

Proof. Consider any fixed error P supported on a set of sites E. Let E = FqU FiU ■ ■ ■ U Em be 
the chunk decomposition of E, and let Fj^a be the Q-' -connected components of Fj. Also, let Bj^a 
be the 1-neighborhood of the smallest box enclosing the syndrome created by the restriction of P 
onto Fj^a- Proposition [7] implies that 



E = Eq^ Ei^ ■■■^ E, 



d{Cu U a) < d{u, v) + d{Cu) + < Q"+V2 



It contradicts to our assumption that u £ Fn = En\ En+i- 



□ 




(B2) 
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iv) 

Let Pec be the accumulated correcting operator returned by the levels 0, ... ,p of the RG decoder. 
Let us use induction in p to prove the following statement. 

1. The operator Pec has support on the union of the boxes Bj^^. 

(p) 

2. The operators Pec and P have the same restriction on Bj^a modulo stabilizers for any j 
such that 2P>Q^ + 2. 



the syndrome S{P) is fully contained inside some box Bj^a- It proves that Pec^ has support on the 

(p) 

Suppose we have proved the above statement for some p. Then the operator P-Pec has support 



The base of induction is p = 0. Using Eq. (B2) we conclude that any 1-connected component of 
the syndrome S{P) is fully contained inside some box It pro' 
union of the boxes Bj^a- The second statement is trivial for p = 
p: 

only inside boxes Bj^^ such that 2^ < + 1 (modulo stabilizers). It follows that any 2P"^^-connected 

(p) 

component of the syndrome caused by P ■ Pec is contained in some box Bj^a with 2^ < + 1. 
Note that the RG decoder never adds new defects; we just need to check that 2P"''^-connected 
components do not cross the boundaries between the boxes Bj^a with 2^ < + \. This follows 



from Eq. (B2). Hence Pec^^^ has support in the union of Bj a- Furthermore, if 2^ < Q-' + 1 < 2'^^^, 



the cluster of defects created by P • Pic^ inside Bj^a forms a single 2P+^-connected component of 
the syndrome examined by EC(p + 1). This cluster is neutral since we assumed Q"^^^ < Ltqo- 
Hence Pec^^'^ will annihilate this cluster. The annihilation operator is equivalent to the restriction 
of P ■ Pec^ onto Bj^a modulo stabilizers, since the linear size of Bj^a is smaller than Ltqo- It proves 
the induction hypothesis for the level p + 1. □ 

The preceding lemma says that errors by which the RG decoder could be confused are those 
from very high level chunks. What is the probability of the occurrence of such a high level chunk 



if the error is random according to Eq.(Bl)? Since our probability distribution of errors depend 
only on the number of sites in E, this question is completely percolation-theoretic. 

Let us review some terminology from the percolation theory [24j. An event is a collection of 
configurations. In our setting, a configuration is a subset of the lattice. Hence, we have a partial 
order in the configuration space by the set-theoretic inclusion. An event £ is said to be increasing 
if E £ £,E <^ E' implies E' € £. For example, the event defined by the criterion that there exists 
an error at (0,0), is increasing. The disjoint occurrence A o B the events A and B is defined 
as the collection of configurations E such that E = Ea U Eb is a disjoint union of Ea €z A and 
Ef, E B. To illustrate the distinction between Ao B and ACiB, consider two events defined as ^ = 
"there are errors at (0,0) and at (1,0)", and B = "there are errors at (0,0) and at (0,1)". The 
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intersection ACi B contains a configuration {(0, 0), (1, 0), (0, 1)}, but the disjoint occurrence Ao B 
does not. A useful inequality by van den Berg and Kesten (BK) reads 



Fr[AoB]<Fr[A]-Fr[B] (B3) 
provided the events A and B are increasing. 



Proof, (of Theorem[2]) Consider a I?-dimensional lattice and a random error E defined by Eq. (Bl ) 



Let Bn be a fixed cubic box of linear size and be the box of linear size 3Q" centered at B^- 
Define the following probabilities: 

Pn = Pr [Bn has a non-zero overlap with a level-n chunk of E] 
Pn = Pr [B^ contains a level-n chunk of E] 
Qn = Pr [B^ contains 2 disjoint level-(n — 1) chunks of E] 
Vn = Pr [B^ contains a level-(n — 1) chunk of E] 

Note that all these probabilities do not depend on the choice of the box Bn due to translation- 
invariance. Since a level-0 chunk is just a single site of E, we have po = e. We begin by noting 
that 

Pn<Pn< Qn- 

Here we used the fact that any level-n chunk has diameter at most and that any level-n 

chunk consists of a disjoint pair of level- (n — 1) chunks. Let us fix the box B^ and let Q„ be the 
event that B^ contains a disjoint pair of level-(n — 1) chunks of E. Let TZn be the event that B^ 
contains a level-(n — 1) chunk of E. Then Q„ = IZnolZn- It is clear that Qn and IZn are increasing 
events. Applying the van den Berg and Kesten inequality we arrive at 

Qn < rl. 

Finally, since B^ is a disjoint union of {2>Q)^ boxes of linear size Q"'~^, the union bound yields 

Tn < (3g)V-i- 

Combining the above inequalities we get pn < {3Q)'^^p1_i, and hence 

Pn<{3Qr'^{{3Qf^er. 

The probability pn is doubly exponentially small in n whenever e < (3(5)^^^. If there exists at 
least one level-n chunk, there is always a box of linear size Q"^ that overlaps with it. Hence, on 
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the finite system of linear size L, the probability of the occurrence of a level-m chunk is bounded 
above by L^pm- Employing Lemma [5} we conclude that the RG decoder fails with probability at 
most Pfaii = L^Pm for any m such that Q'"^'^'^ < Ltqo- Since we assumed that Ltgo > L'^i one can 
choose m w 7 log L/ log Q. In this case pfaii = exp (—Q{L'^)) for 7/ w 7/ log Q. We have proved our 
theorem with eo = (3(5)^^^ where Q = 10. □ 
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